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ABSTRACT 



A hierarchical memory structure includes a directory-based 
main memory coupled to multiple first storage devices, each 
to store data signals retrieved from the main memory. Ones 
of the first storage devices are further respectively coupled 
to second storage devices, each to store data signals 
retrieved from the respectively coupled first storage devices. 
Fetch requests to retrieve data signals are issued by ones of 
the storage devices to the main memory. In response, the 
main memory determines where the most recent data copy 
resides, and issues a return request, if necessary to retrieve 
that copy for the requesting storage device. A speculative 
return generation logic circuit is coupled to at least two of 
the first storage devices to intercept the fetch requests. In 
response to an intercepted request, the speculative return 
generation logic circuit generates a speculative return 
request directly to one or more of the other coupled first 
storage devices. This speculative return request causes any 
updated copies of the requested data signals that may be 
stored at a lower level in the hierarchical memory, to be 
transferred to the first storage device. If a return request for 
the data is then issued by the main memory in response to 
the fetch request, the requested data signals are resident in 
a first storage device, and are readily available to the main 
memory. 

20 Claims, 8 Drawing Sheets 
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SYSTEM AND METHOD FOR PROVIDING 
THE SPECULATIVE RETURN OF CACHED 
DATA WITHIN A HIERARCHICAL MEMORY 
SYSTEM 

CROSS-REFERENCE TO OTHER ^ 
APPLICATIONS 

The following co-pending applications of common 
assignee contain some common disclosure: 

"A Directory -Based Cache Coherency System", filed 
Nov. 05, 1997, Ser. No. 08/965,004, incorporated 
herein by reference in its entirety; 

"High-Speed Memory Storage Unit for a Multiprocessor 
System Having Integrated Directory and Data Storage 
Subsystems", filed Dec. 31, 1997, Ser. No. 09/001,588, 
incorporated herein by reference in its entirety; and 

"Directory -Based Cache Coherency System Supporting 
Multiple Instruction Processor and Input/Output 
Caches", filed Dec. 31, 1997, vSer. No. 09/001,598, 
incorporated herein by reference in its entirety. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

This invention relates generally to an improved hierar- 
chical memory system shared between multiple processors; 
and more particularly, relates to a memory system that' 
performs transfers of cached data between hierarchical lev- 
els of the memory in anticipation of receiving requests to 
retrieve the data, the transfers being performed so that the 
data is more readily available to the requester when the 
anticipated request is received. 

2. Description of the Prior Art 

Data processing systems are becoming increasing com- 
plex. Some systems, such as Symmetric Multi-Processor 35 
(SMP) computer systems, couple two or more Instruction 
Processors (IPs) and multiple Input/Output (I/O) Modules to 
shared memory. This allows the multiple IPs to operate 
simultaneously on the same task, and also allows multiple 
tasks to be performed at the same time to increase system 40 
throughput. As the number of units coupled to a shared 
memory increases, more demands are placed on the memory 
and memory latency increases. To address this problem, 
high-speed local memory systems, including caches and 
high-speed I/O buffer memories, are often coupled to one or 45 
more of the IPs for storing data signals that are copied from 
main memory. These memories are generally capable of 
processing requests faster than the main memory while also 
serving to reduce the number of requests that the main 
memory must handle. This increases system throughput. 50 

While the use of local memories increases system 
throughput, it causes other design challenges. When mul- 
tiple local memories are coupled to a single main memory 
for the purpose of temporarily storing data signals, some 
system must be utilized to ensure that all IPs and I/O 55 
Modules are working from the same (most recent) copy of 
the data. For example, if a copy of a data item is stored, and 
subsequently modified, in a cache memory, another IP 
requesting access to the same data item must be prevented 
from using the older copy of the data item stored either in 60 
main memory or the requesting IP's cache. This is referred 
to as maintaining cache coherency. Maintaining cache 
coherency becomes more difiBcult as more caches are added 
to the system since more copies of a single data item may 
have to be tracked. 65 

Many methods exist to maintain cache coherency. Some 
earlier systems achieve coherency by implementing memory 
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locks. That is, if an updated copy of data exists within a local 
cache or buffer memory, other processors are prohibited 
from obtaining a copy of the data from main memory until 
the updated copy is returned to main memory, thereby 
releasing the lock. For complex systems, the additional 
hardware and/or operating time required for setting and 
releasing the locks within main memory cannot be justified. 
Furthermore, reliance on such locks directly prohibits cer- 
tain types of applications such as parallel processing. 

Another method of maintaining cache coherency is shown 
in U.S. Pat. No. 4,843,542 issued to Dashiell et al., and in 
U.S. Pat. No. 4,755,930 issued to Wilson, Jr. et al These 
patents discuss a system wherein each processor has a local 
cache coupled to a shared memory through a common 
memory bus. Each processor is responsible for monitoring, 
or "snooping", the common bus to maintain currency of its 
own cache data. These snooping protocols increase proces- 
sor overhead, and are unworkable in hierarchical memory 
configurations that do not have a common bus structure. A 
similar snooping protocol is shown in U.S. Pat. No, 5,025, 
365 to Mathur et al., which teaches a snooping protocol that 
seeks to minimize snooping overhead by invalidating data 
within the local caches at times when other types of cache 
operations are not occurring. However, the Mathur system 
can not be implemented in memory systems that do not have 
a common bus structure. 

Another method of maintaining cache coherency is shown 
in U.S. Pat, No. 5,423,016 to Tsuchiya assigned to the 
assignee of the current invention. The method described in 
this patent involves providing a memory structure called a 
"duplicate tag" that is associated with each cache memory. 
Each duplicate tag records which data items are stored 
within the associated cache. When a data item is modified by 
a processor, an invalidation request is routed to all of the 
other duplicate tags in the system. The duplicate tags are 
searched for the address of the referenced data item. If 
found, the data item is marked as invalid in the other caches. 
Such an approach is impractical for distributed systems 
having many caches interconnected in a hierarchical fashion 
because the time required to route the invaHdation requests 
poses an undue overhead. 

For distributed systems having hierarchical memory 
structures, a directory -based coherency system becomes 
more practical. Directory -based coherency systems utilize a 
centralized directory to record the location and the status of 
data as it exists throughout the system. For example, the 
directory records which caches have a copy of the data, and 
further records if any of the caches have an updated copy of 
the data. When a cache makes a request to main memory for 
a data item, the central directory is consulted to determine 
where the most recent copy of that data item resides. Based 
on this information, the most recent copy of the data is 
retrieved so that it may be provided to the requesting cache. 
The central directory is then updated to reflect the new status 
for that unit of memory. A novel directory-based cache 
coherency system for use with multiple Instruction Proces- 
sors coupled to a hierarchical cache structure is described in 
the CO -pending application entitled "Directory -Based Cache 
Coherency System Supporting Multiple Instruction Proces- 
sor and Input/Output Caches" referenced above and which is 
incorporated herein by reference in its entirety. 

The use of the a fore -mentioned directory-based cache 
coherency system provides an efficient mechanism for shar- 
ing data between multiple processors that are coupled to a 
distributed, hierarchical memory structure. Using such a 
system, the memory structure may be incrementally 
expanded to include any multiple levels of cache memory 
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while still maintaining the coherency of the shared data. As are not all interconnected by a common bus structure, 

the number of levels of hierarchy in the memory system is obtaining an exclusive copy of data that can be utilized by 

increased, however, some efficiency is lost when data a processor for update purposes may be time-consuming. As 

requested by one cache memory in the system must be the number of these so-called "ownership" requests for 

retrieved from another cache. 5 obtaining an exclusively "owned" data copy increases 

As an example of performance degradation associated within the system, throughput may decrease. TTiis Ls espe- 

with memory requests in a hierarchical cache memory cially true as additional levels of hierarchy are included in 

system, consider a system having a main memory coupled to the memory structure. 

three hierarchical levels of cache memory. In the exemplary Qne mechanism for increasing throughput involves pro- 
system, multiple third-level caches are coupled to the main J, ^i^i ^ high-speed data return path within the main 
memory, multiple second-level caches are coupled to each ^ ^^^^ ^^^^ ^^^^^^^ ^^^^ ^ ^^^^^^ ^ 
hird-level cache, and at leastone first-level cache IS c^^^^^ high-speed interface forwards the data directly to the 
to each second-level cache. This exemplary system includes f -.u . *u j . v * a ■ 

. , . . ' u T-u- *u * u J 4 requester without the need to perform any type 01 main 

a non-inclusive caching scheme. This means that all data ^ all j - * ^ t /u- f u 

,. 01,.- . -1 j-.u memory access. A high-speed interface of this type can be 

stored in a first-level cache is not necessanly stored in the ^ , , . u .u j-c j j j-^i j j * u . 

- , 1 . J 11 J . . J • 15 used to route both modified and unmodified data between 

interconnected second-level cache, and all data stored in a . • *t. ♦ o u * • j u ^ 

J , , . . ^ 1 . J ' 1 J the vanous units m the system. Such a system is described 

second-level cache is not necessarily stored m the coupled • .i_ n o * . i- j up * j a* *u ^ 

third level cache ^ m the U.S. patent apphcation entitled System and Method 

„ ' . , ' ^ , for By-Passing Supervisory Memory Intervention for Data 

Withm the above-described system, one or more proces- Transfers Between Devices Having Local Memories", Pat. 

sors are respectively coupled to make memory requests to an 6,167,489, issued Dec. 26, 2000. While this tvpe of 

associated first -level cache. Requests for data items not -^^^^^^^^ decreases the time required to complete the data 

resident in the first-level cache are forwarded on to the ^^^^^^ operation, latency is still imposed by the length of the 

intercoupled second-level, and m some cases the third- eve ^^^^ ^^^^^^ ^^-^^ ^^^^^^ ^^^^ j^^^^^ 1^^^^^ 

caches. If neither of the intercoupled second or third level ^ hierarchy, to main memory, and back to the lowest 

caches stores the requested data, the request is forwarded to ^^^^^^ j^^^^^ ^^^^^^^ therefore, is a system that 

mam memory. minimizes the time required to return data to a requesting 

Assume that in the current example, a processor makes a processor coupled to the hierarchical memory svstem by 

request to the intercoupled first-level cache for a read-only shortening the data return path, 

copy of specified data. Assume further that the requested Objects* 

data is not stored in this first-level cache. However, another 30 -phe primary object of the invention is to provide an 

first-level cache within the system stores a read-only copy of improved shared memory system for a multiprocessor data 

the data. Since the copy of the data is read-only, the request processing system* 

can be completed without involving the other first-level ^ f^^her object is to provide a hierarchical, directory- 
cache. That is, the request may be processed by one of the ^^^^ ^^^^^^ memory system having improved response 
interconnected second or third-level caches, or if neither of times* 

these caches has a copy of the data, by the main memory. ^ f^irther object is to provide a system for use with a 

In addition to requests for read-only copies of data, hierarchical memory that transfers data up the hierarchical 

requests may be made to obtain "exclusive" copies of data memory structure in anticipation of receipt of a request to 

that can be updated by the requesting processor. In these provide the data to the highest level in the memory hierar- 

situations, any previously cached copies of the data must be 40 chy; 

marked as invalid before the request can be granted to the Another object is to provide a system that allows modified 

requesting cache. That is, in these instances, copies of the data residing in first and second-level cache memories to be 

data may not be shared among multiple caches. This is provided to an associated third-level cache memory in 

necessary so that there is only one "most-current" copy of anticipation of the third-level cache memory receiving a 

the data existing in the system and no processor is working 45 request to transfer the data to a main memory; 

from outdated data. Returning to the current example, a yet further object is to provide a system that generates 

assume the request to the first-level cache is for an exclusive speculative return requests requesting the transfer of data 

copy of data. This request must be passed via the cache between first and second storage devices included within a 

hierarchy to the main memory. The main memory forwards hierarchical memory system so that an anticipated fetch 

this request back down the hierarchical memory structure to 50 operation for the data can be completed more quickly; 

the first-level cache that stores the requested data. This a still further object is to provide a hierarchical memory 

first-level cache must invalidate its stored copy of the data, system that allows speculative return requests that are pend- 

indicating that this copy may no longer be used. If this ing to a cache memory to be discarded after the main 

first-level cache had an exclusive copy of the data, and had memory issues a request for the data that is associated with 

further modified the data, the modified data is passed back 55 the speculative return request; 

to the main memory to be stored in the main memory and to Another object is to allow a cache memory to probe one 

be forwarded on to the requesting first-level cache. In this or more associated cache memories to determine the pres- 

manner, the requesting cache is provided with an exclusive ence of updated data in anticipation of receiving a request 

copy of the most recent data. for the data; 

The steps outlined above with respect to the exclusive 60 A still further object is to allow a first cache memory to 

data request are similar to those that must be executed if a provide requests to one or more associated cache memories 

read-only copy of the data is requested when a copy of the requesting invalidation of predetermined data that may 

requested data resides exclusively in another cache. '^The potentially reside within the associated cache memories in 

previous exclusive owner must forward a copy of the preparation for possible receipt by the first cache memory of 

updated data to main memory to be returned to the requester. 65 a request for that predetermined data; 

As may be seen from the current example, in a hierar- Another object is to allow a first memory to provide 

chical memory system having multiple levels of cache that requests to one or more associated memories requesting that 
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a shared copy ot daia potcmially residing within one or more 
of the associated memories be provided to the first memory 
in preparation for possible receipt by that first memory of a 
request for a shared data copy; 

Yet another object is to allow a first cache memory to 5 
provide requests to one or more associated cache memories 
requesting that an exclusive copy of data that may poten- 
tially reside within the associated cache memories be pro- 
vided to the first cache memory in preparation for possible 
receipt by the first cache memory of a request for an lo 
exclusive data copy; and 

Still another object is to provide a system that allows 
predetermined fetch requests issued within a data processing 
system to generate requests to transfer the requested data 
between various memory resources even before it is known 15 
where the latest copy of the data resides. 

SUMMARY OF THE INVENTION 

l^e objectives of the present invention are achieved in a 
speculative return system that generates requests to transfer 20 
data between one or more levels within a hierarchical 
memory structure in anticipation of receiving a request for 
the data, llie hierarchical memory structure includes a main 
memory coupled to multiple first storage devices, each of 
which stores data signals retrieved firom the main memory. 25 
Ones of the first storage devices are further respectively 
coupled to second storage devices, each of which stores data 
signals retrieved from the respectively coupled first storage 
devices. In the preferred embodiment, the first and second 
storage devices are cache memories, and the main memory 30 
is a directory-based memory that includes a directory to 
indicate which of the other memories is storing a copy of 
addressable portions of the memory data. 

According to the coherency scheme of the hierarchical 
memory structure, each of the first storage devices is capable 35 
of generating a fetch request to the main memory to obtain 
a copy of requested ones of the data signals. In some 
instances, the main memory does not store the latest copy of 
the requested data signals, as will be indicated by corre- 
sponding status signals stored in the directory memory. 40 
When this occurs, the main memory issues a return request 
to cause a target one of the first storage devices to return the 
latest copy of the requested data signals to the main memory 
so these signals can be forwarded to the requesting storage 
device. In some cases, however, the target one of the first 45 
storage devices, has, in turn, provided the requested data 
signals to one or more of the respectively coupled second 
storage devices. Additional storage devices may be further 
coupled to these second storage devices for storing data 
copies. Thus, the data signals must be transferred up the 50 
hierarchical memory structure, from the storage devices at 
the lowest level in the memory hierarchy to the target 
storage device, and finally to the main memory. 'ITiis 
imposes latency. 

The speculative return system of the current invention 55 
decreases the lime required for the main memory to retrieve 
data signals stored in a lower level in the hierarchical 
memory system. The speculative return system includes at 
least one speculative return generation logic circuit coupled 
to at least two of the first storage devices. The speculative 60 
return generation logic circuit intercepts fetch requests gen- 
erated by any of the coupled first storage devices. In 
response thereto, the speculative return generation logic 
circuit generates a speculative return request to one or more 
of the other coupled first storage devices. The speculative 65 
return request causes these first storage devices to prepare to 
send any stored, updated copy of the requested data signals 
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to main memory. This includes retrieving any updated 
copies of the requested data signals that may be stored at a 
lower level in the hierarchical memory structure, including 
those copies stored in the respectively coupled second 
storage devices. 

While any stored copies of the requested data signals are 
being retrieved in response to the speculative return request, 
the original fetch request is received by the main memory. 
In response thereto, the main memory may generate a return 
request to a target one of the first storage devices to return 
the latest copy of the requested data signals. If the target one 
of the first storage devices is one of the one or more storage 
devices that has already executed the speculative return 
request, the requested data signals are already resident in the 
target storage device upon receipt of the return request. 
These data signals may therefore be provided immediately 
by the target storage device to the main memory so they can 
be forwarded to the requesting storage device. This 
decreases memory latency. 

In the current hierarchical memory system, various types 
of fetch requests may be generated to the main memory. 
According to one aspect of the speculative return generation 
system, a speculative return request is generated only in 
response to the receipt of predetermined types of fetch 
requests. For example, in the preferred embodiment, some 
fetch requests are associated with the retrieval of an exclu- 
sive data copy, whereas other fetch requests initiate the 
retrieval of a read-only data copy. Still other types of fetches 
are conditional fetches that trigger the execution of a pre- 
diction algorithm to determine whether an exclusive, versus 
a read-only copy, will be retrieved. The current speculative 
return generation system generates speculative return 
requests for exclusive-copy fetches and some conditional 
fetches. This design choice is made to minimize the unnec- 
essary transfer of data signals within the hierarchical 
memory when it is likely that the read-only, shared data copy 
is already available from the main memory. 

According to another aspect of the invention, several 
types of speculative return requests may be generated 
depending on the type of fetch request that is issued. In the 
preferred embodiment, a fetch request that is requesting an 
exclusive data copy initiates a predetermined type of specu- 
lative return request that purges any stored data copy from 
the lower levels in the memory. Alternatively, a fetch request 
requesting a shared, read-only data copy initiates a specu- 
lative return request that allows lower memory levels to 
retain a shared, read-only data copy while returning a 
read-only copy to a respective one of the first storage 
devices. 

The current speculative return system includes logic to 
temporarily store speculative return requests, if necessary, 
prior to providing those requests to a respectively-coupled 
one of the first storage devices for processing in an order 
determined by a predetermined priority scheme. The specu- 
lative return generation system is further coupled to receive 
from the main memory all return requests that are generated 
to any of the respectively-coupled ones of the first storage 
devices. If a return request is received that was initiated by 
the same fetch request that initiated a still-pending specu- 
lative return request, the speculative return request is dis- 
carded. The speculative return request is not needed in this 
instance since the transfer of data from the lower to the 
higher levels of the memory is accomplished via execution 
of the return request itself 

In one embodiment of the invention, the first storage 
devices are each associated with a tag memory. This tag 
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memory stores status signals descriptive of the data signals ncction referred to as an MSU Interface (MI) in dashed 

stored in the associated first storage device, and in additional block 130, individually shown as 130A through 130S. For 

ones of the storage devices coupled to the associated first example, MI 130A interfaces POD 120A to MSU llOA, MI 

storage device at a lower level of the memory hierarchy. 130B interfaces POD 120A to MSU HOB, MI 130C intcr- 

Thcse status signals describe both the location and type of 5 faces POD 120A to MSU HOC, MI 130D interfaces POD 

any copies of the data signals residing in these storage 120A to MSU llOD, and so on. 

structures. Speculative return requests issued to first storage In one embodiment of the present invention, MI 130 

devices initiate the return of data signals from lower levels comprises separate bi-directional data and bi-directional 

in the memory hierarchy only if the status signals in the tag address/command interconnections, and further includes 

memory indicate that a predetermined type of data copy lo unidirectional control lines that control the operation on the 

exists for the requested data signals. In the preferred data and address/command interconnections (not individu- 

embodiraent, this data transfer occurs only if an exclusive, ally shown). The control lines run at system clock frequency 

read/write copy of the data signals is resident in the lower (SYSCLK) while the data bus runs source synchronous at 

memory levels. This design choice is made to optimize two times the system clock frequency (2xSYSCLK). 

memory efficiency. 15 Any POD 120 has direct access to data in any MSU 110 

Still other objects and advantages of the present invention via one of Mis 130. For example, MI 130A allows POD 

will become readily apparent to those skilled in the art from 120A direct access to MSU llOA and MI 130F allows POD 

the following detailed description of the preferred embodi- i20B direct access to MSU HOB. PODs 120 and MSUs 110 

ment and the drawings, wherein only the preferred embodi- are discussed in further detail below, 

ment of the invention is shown, simply by way of illustration 20 System Platform 100 further comprises Input/Output 

of the best mode contemplated for carrying out the inven- (W) Modules in dashed block 140 individually shown as 

tion. As will be reaUzed, the invention is capable of other W Modules 140A through 140H, which provide the inter- 

and different embodiments, and its several details are face between various Input/Output devices and one of the 

capable of modifications in various respects, all without PODs 120. Each I/O Module 140 is connected to one of the 

departing from the invention. Accordingly, the drawings and 25 PODs across a dedicated point-to-point connection called 

description are to be regarded to the extent of applicable law the MIO Interface in dashed block 150 individually shown 

as illustrative in nature and not as restrictive. as 150A through 150H. For example, I/O Module 140A is 

connected to POD 120A via a dedicated point-to-point MIO 

BRIEF DESCRIPTION OF THE FIGURES Interface 150A. The MIO Interfaces 150 are similar to the 

^ , , ^ 30 MI Interfaces 130, but in the preferred embodiment have a 

Tlie present mvention will be described with reference to ^^^^^^^ ^^^^ ^^^^ approximately half the transfer rate of the 

the accompanymg drawmgs. Interfaces because the I/O Modules 140 are located at a 

FIG. 1 is a block diagram of a Symmetrical MultiProces- greater distance from the PODs 120 than are the MSUs 110. 

sor (SMP) system platform according to a preferred embodi- Processing Module (POD); 

ment of the present invention; 35 piG. 2 is a block diagram of a processing module (POD) 

FIG. 2 is a block diagram of a Processing Module; according to one embodiment of the present invention. POD 

FIG. 3 is a block diagram of the Sub-Processing Module; 120A is shown, but each of the PODs 120A through 120D 

FIG. 4 is a block diagram of the TCM of the preferred ^^^^^ ^ similar configuration. POD 120A includes two Sub- 

embodiraenf Processing Modules (Sub-PODs) 210A and 210B. Each of 

^. ' . ^ 40 the Sub-PODs 210A and 210B are interconnected to a 

FIG. 5 IS a block diagram of Command/Function Routing ^^^^^ ^^^^^^ ^^^^^ 220 through dedicated point-to- 

^S^^' point Sub-POD Interfaces 230A and 230B, respectively, that 

FIG. 6 is block diagram of the Third Level Cache; ^re similar to the MI interconnections 130. TCM 220 further 

FIG. 7 is a block diagram illustrating the format of interconnects to one or more I/O Modules 140 via the 

requests as provided by the TCM to the MSU; 45 respective point-to-point MIO Interfaces 150. TCM 220 

FIG. 8 is a block diagram illustrating the format of both buffers data and functions as a switch between the 

requests provided by the MSU to the TCM; Sub-POD Interfaces 230A and 230B, the MIO Interfaces 

FIG. 9 is a table summarizing the types of Speculative ISOAand 150B, and the MI Interfaces 130A through 130D. 

Return Functions that are generated by the TCM in response When an I/O Module 140 or a Sub-POD 210 is intercon- 

to receiving various ones of the Fetch commands from a 50 nected to one of the MSUs via the TCM 220, the MSU 

Sub-POD; and connection is determined by the address provided by the I/O 

RG. 10 is a block diagram of the Speculative Return ^"^^^^ ^^^^ Sub-POD, respectively. In general, the TCM 

Generation Logic maii^ one-fourth of the memory address space to each of the 

MSUs IIOA-IIOD. According to one embodiment of the 

DETAILED DESCRIPTION OF THE 55 current system platform, the. TCM 220 can further be 

PREFERRED EMBODIMENTS configured to perform address interleaving functions to the 

System Platform: various MSUs. The TCM may also be utilized to perform 

FIG. 1 is a block diagram of a Symmetrical Multi- address translation functions that are necessary for ensuring 

Processor (SMP) System Platform according to a preferred that each processor (not shown in FIG. 2) within each of the 

embodiment of the present invention. System Platform 100 60 Sub-PODs 210 and each I/O Module 140 views memory as 

includes one or more Memory Storage Units (MSUs) in existing within a contiguous address space as is required by 

dashed block HO individually shown as MSU llOA, MSU certain off-the-shelf operating systems, 

HOB, MSU lie and MSU lOlD, and one or more Process- In one embodiment of the present invention, I/O Modules 

ing Modules (PODs) in dashed block 120 individually 140 are external to Sub-POD 210 as shown in FIG. 2. This 

shown as POD 120A, POD 120B, POD 120C, and POD 65 embodiment allows System Platform 100 to be configured 

120D. Each unit in MSU HO is interfaced to all PODs 120A, based on the number of I/O devices used in a particular 

120B, 120C, and 120D via a dedicated, point-to-point con- application. In another embodiment of the present invention. 
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